Reinforcement Learning with PI2 Algorithm to Generate Motor Primitives of a Complex Snake-Like Robot

نویسنده

  • Sromona Chatterjee
چکیده

In this thesis work a policy improvement algorithm called Policy Improvement with Path Integrals (PI2) is used to generate goal-directed locomotion of a complex snake-like robot with screw-drive units. PI2 is numerically simple and has an ability to deal with high dimensional systems. Here, PI2 is used to find proper locomotion control parameters, like joint angles and screw-drive unit velocities, of the robot. The learning process is achieved in simulation and the learned parameters are successfully transferred to the real robot. As a result the robot can locomote towards a given goal. Furthermore, as PI2 generates proper control parameter sets for different goals and robot body shapes (like, straight-line, zigzag, arc), a considerable and meaningful repertoire of robot behaviors is obtained in this way. Out of this, certain learned parameters sets are selected as motor primitives to generalize goal-directed locomotion generation in this work and generate new behaviors online. By selecting different primitives and properly chaining or combining them along with parameter interpolation, the robot can successfully handle complex tasks like, reaching a single goal or multiple goals while avoiding obstacles and compensating to a change in its body shape. As a result, the robot can successfully locomote towards a given goal as well as handle a complex environment or an unknown situation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Reinforcement Learning Approach for Goal-directed Locomotion of a Complex Snake-Like Robot

Snake-like robots conventionally use undulation movements for locomotion [1], [2]. In contrast, a nonstandard snake-like robot having four screw-drive units which are connected serially by three active joints [3] uses propulsion by rotating its screw-drive units to move. Employing this screw-drive mechanism, undulation is not required for its locomotion. This helps it to move in narrow spaces. ...

متن کامل

Reinforcement learning of motor skills with policy gradients

Autonomous learning is one of the hallmarks of human and animal behavior, and understanding the principles of learning will be crucial in order to achieve true autonomy in advanced machines like humanoid robots. In this paper, we examine learning of complex motor skills with human-like limbs. While supervised learning can offer useful tools for bootstrapping behavior, e.g., by learning from dem...

متن کامل

Policy Search for Imitation Learning

Efficient motion planning and possibilities for non-experts to teach new motion primitives are key components for a new generation of robotic systems. In order to be applicable beyond the well-defined context of laboratories and the fixed settings of industrial factories, those machines have to be easily programmable, adapt to dynamic environments and learn and acquire new skills autonomously. ...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Learning Motor Skills: From Algorithms to Robot Experiments

Ever since the word “robot” was introduced to the English language by Karel Čapek’s play “Rossum’s Universal Robots” in 1921, robots have been expected to become part of our daily lives. In recent years, robots such as autonomous vacuum cleaners, lawn mowers, and window cleaners, as well as a huge number of toys have been made commercially available. However, a lot of additional research is req...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014